GPU-SME-kNN: Scalable and memory efficient kNN and lazy learning using GPUs
نویسندگان
چکیده
The k nearest neighbor (kNN) rule is one of the most used techniques in data mining and pattern recognition due to its simplicity and low identification error. However, the computational effort it requires is directly related to the dataset sizes, hence delivering a poor performance on large datasets. ::: The :::: use :: of :::::::: graphics processing units (GPU) ::: has :::::::: improved :::: the :::::::: run-time : performance of the kNN rule but the computational requirements of current ::::::::: approaches ::::: limit ::: this : performance as the dataset size increases. In this paper : , : we propose a new scalable and memory efficient design for a GPU-based kNN rule, called GPU-SME-kNN, that ::::: breaks :::: the ::::::::::: dependency between dataset size and memory footprint while delivering high performance. An experimental study of GPU-SME-kNN is presented showing a high performance, even in cases that other :::::::: methods : cannot address, ::::: while ::: the :::::::::::::: computational ::::::::::: requirements :::: are : suitable for most commercial GPU devices. Our design has also been applied to kNN-based lazy learning algorithms reducing run-times in a significant way.
منابع مشابه
GPU-FS-kNN: A Software Tool for Fast and Scalable kNN Computation Using GPUs
BACKGROUND The analysis of biological networks has become a major challenge due to the recent development of high-throughput techniques that are rapidly producing very large data sets. The exploding volumes of biological data are craving for extreme computational power and special computing facilities (i.e. super-computers). An inexpensive solution, such as General Purpose computation based on ...
متن کاملkNN-Borůvka-GPU: A Fast and Scalable MST Construction from kNN Graphs on GPU
Computation of the minimum spanning tree (MST) is a common task in numerous fields of research, such as pattern recognition, computer vision, network design (telephone, electrical, hydraulic, cable TV, computer, road networks etc.), VLSI layout, to name a few. However, for a large-scale dataset when the graphs are complete, classical MST computation algorithms become unsuitable on general purpo...
متن کاملStudy of K Nearest Neighbour Applications in Image Processing with Graphics Processing Unit
The GPU has always been a processor with ample computational resources. It is a powerful graphics engine and a highly parallel programmable processor having better efficiency and high speed that overshadows CPU. It is used in high performance computing system. The implementation of GPU can be done with CUDA C. Due to its highly parallel structure it is used in a number of real time applications...
متن کاملKNN Model-Based Approach in Classification
The k-Nearest-Neighbours (kNN) is a simple but effective method for classification. The major drawbacks with respect to kNN are (1) its low efficiency being a lazy learning method prohibits it in many applications such as dynamic web mining for a large repository, and (2) its dependency on the selection of a “good value” for k. In this paper, we propose a novel kNN type method for classificatio...
متن کاملA GPU-based Method for Computing Eigenvector Centrality of Gene-expression Networks
In this paper, we present a fast and scalable method for computing eigenvector centrality using graphics processing units (GPUs). The method is designed to compute the centrality on gene-expression networks, where the network is pre-constructed in the form of kNN graphs from DNA microarray data sets.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Inf. Sci.
دوره 373 شماره
صفحات -
تاریخ انتشار 2016